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ABSTRACT 

The stochasticity in the distribution of dark haloes in the cosmic density field is 
reflected in the distribution function Pv(Nf l \S m ) which gives the probability of finding 
Nh haloes in a volume V with mass density contrast S m . We study the properties of 
this function using high-resolution TV-body simulations, and find that Pv(N n \5 m ) is 
significantly non-Poisson. The ratio between the variance and the mean goes from ~ 1 
(Poisson) at 1 + 6 m <C 1 to < 1 (sub-Poisson) at 1 + 6 m ~ 1 to > 1 (super-Poisson) at 
1 + 5 m 3> 1. The mean bias relation is found to be well described by halo bias models 
based on the Press-Schechter formalism. The sub-Poisson variance can be explained 
as a result of halo-exclusion while the super-Poisson variance at high S m may be 
explained as a result of halo clustering. A simple phenomenological model is proposed 
to describe the behavior of the variance as a function of S m . Galaxy distribution in 
the cosmic density field predicted by semi-analytic models of galaxy formation shows 
similar stochastic behavior. We discuss the implications of the stochasticity in halo 
bias to the modelling of higher-order moments of dark haloes and of galaxies. 
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1 INTRODUCTION 



In the current scenario of galaxy formation, galaxies are as- 



MW; |Mo et al. 1996|; |Jing et al. 199l| [Sheth fc Tormen 199~ 



sumed o form by the co oling and condensat i on of gas within 



dark matter haloes (e.g. White & Kecs 1978 f White fc Frcnk 



1991). The problem of galaxy clustering in space can there- 
fore be approached by understanding the spatial distribution 
of dark haloes and the formation of galaxies in individual 
dark haloes. This approach to the problem of galaxy spatial 
clustering is very useful because the formation and clustering 
properties of dark haloes can be modelled relatively reliably 
due to the simplicity of the physics involved (gravity only) 
and because realistic models of galaxy formation in dark 
halo es can now be construc t ed using semi-analytic models 
(e.g. [Kauffmann et al. 1999] ; |Cole et al. 200$ |Somerville & 
Primack 1999| ). Indeed, there are quite a few recent investi- 
gations attemptin g to model galax y clustering base d on the 



Governato et al. 199£ ; CoIbo7g"TrT>L 2000] ) . High-order mo- 
ments of the halo distribution have also been modeled by Mo 
et al. (1997) based on a deterministic bias relation. These 
authors showed that the model works on large scales in com- 
parison with N-body simulations. Nevertheless, the effect of 
stochasticity may be important in these high-order statis- 
tics as well as in the full distribution function of haloes. In 
fact, the non-Poissonian behavior of the bias relation is al- 
ready emphasized in the original paper of MW; in particular, 
MW poin ted out that halo-exclusi on can cause sub-Poisson 
variance. |5heth fe Lemson (1999| ) showed how the effects 
of stochasticity could be incorporated, easily and efficiently, 
into the analysis of the higher order moments. 



Recently Somerville et al. (2001) used A-body simula- 
tions to study the stochasticity and non-lineari ty of the bias 



b al. 2001 



et al. 2001) 



halo scenario (e.g. Ljlng et al. 1998|; [Via fc Fry 



-200C 



Puacuck fe Smith 200C 



Scocci- 



9eljak 2000 r3faetrj 



relation based on the formalism developed by Dekel & La- 



Based on the Press-Schechter 



Mo 



formalism ( 


Press & 


|Lacey & Co 


le 1994D, 



White (1996 ) (hereafter MW) developed a model for 
the mean bias relation for dark haloes. Their model and it s 
extension based on ellipsoidal collapse ( |5heth et al. 200l| ) 
have been extensively tested by N-body simulations (e.g. 



hav (199£). They analyzed the bias relation for haloes with 
masses larger than 1.0 x 10 12 ft -1 Mq in spherical volumes of 
radius 8 h~ 1 Mpc. Our present work is quite closely related to 
theirs but contains several distinct aspects. First of all, our 
analysis is focused on the distribution function Pv(Nh\5 m ), 
which gives the probability of finding Nh haloes in a volume 
V with mass density contrast 8 m [S m = - — 1, where p is the 
mass density and p is the mean mass density]. As we will 
show later, this function completely specifies the relation be- 
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tween the spatial distribution of haloes and that of the mass 
in a statistical sense. Second, our analysis covers a wider 
range of halo masses and a larger range of volumes for the 
counts-in-cells. Finally, we attempt to develop a theoretical 
model to describe the stochasticity of the bias relation. This 
theoretical model is based on the mean bi as relation given 
in MW and on the variance model given in Sheth & Lemson 
(19990As we will see below, the Sheth & Lemson model fails 
in high mass density regions, where gravitational clustering 
becomes important. One of the main purposes of this paper 
is to show that a simple modification of the Sheth & Lemson 
formulae for the variance allow s one to make accurat e pre- 
dictions even in dense regions. Taruya & Suto (2000) have 
proposed a model for the stochasticity in halo bias relation 
based on the formation-epoch distribution of dark haloes, 
an approach very different from ours. 

The paper is organized as follows. In Section ^| we in- 
troduce the bias relation based on the conditional probabil- 
ity and present a phenomenological model to describe the 
behavior of the variance as a function of the local density 
contrast. In Section |^ we present the numerical data used 
and study the mean and variance of the bias relation. We 
discuss and summarize our results in Section H. 



2 THE HALO-MASS BIAS RELATION 
2.1 The Concept of Bias 

In the following we shall introduce some important concepts 
we will use throughout the paper. For simplicity we intro- 
duce them for the case of dark matter haloes, without loss 
of generality. 

Let us define p as the mass density smoothed in regions 
of some given volume V . The mass density contrast (<5 m ) in 
this volume is then defined as: 

8 m = £-l, (1) 
p 

where p is the mean mass density in the universe. In the 
same way, if Nh and Nh correspond to the number of dark 
matter haloes and to the mean number of dark matter haloes 
in the volume V , respectively, the number density contrast 
of dark matter haloes (Sh) is given by 



Nh 



(2) 



The relationship between Sh and S m is widely known 
as the halo-mass bias. A general way to represent this bias 
relation is to express the halo number density contrast as a 
function of the mass density contrast 



6 h (V) = F(S m (V)). 



(3) 



Since the mass and halo number density contrasts are de- 
fined in a given volume, the bias relation defined in equation 
|§] is assumed to be a function of the local fields and thus 
called "local halo-mass bias". The exact form of the func- 
tion F(8 m (V)) depends on how the objects in consideration 
form in the cosmic density field. 

The bias function F(8 m (V)) can be of several kinds. If 
the bias function is a linear function of the local mass density 
contrast 8 m (V) then it is said that one has linear bias. On 
the other hand, if the bias function is a non-linear function 



of the local mass density contrast S m (V) then one has non- 
linear bias. A special case occurs when F(5 m (V)) = 5 m (V), 
in this case the local halo field is unbiased respect to the 
local mass density field. 

The above mentioned biasing schemes correspond to the 
case of deterministic bias. A more general description of the 
biasing process corresponds to a stochastic one, i.e, there 
is a non-zero dispersion of the bias around its mean and 
the bias relation is described in a probabilistic way. There 
are two factors which can cause such dispersion. First, for 
a given region, not only S m , but also other local properties, 
such as dumpiness and deviation from spherical symmetry, 
can affect the halo number in the region. Second, some non- 
local effects, such as the large-scale tidal field, can affect 
the formation of haloes. In general the stochasticity of the 
bias relation can be described by the conditional distribution 
function, Pv{Nh\S m ), which gives the probability of finding 
Nh haloes in a volume V with mass density contrast S m ■ The 
deterministic bias scheme is recovered when the conditional 
probability can be well approximated by a delta function. 



2.2 The conditional probability 

Dark matter haloes are formed in the cosmological density 
field due to nonlinear gravitational collapse. In general, the 
halo density field is expected to be correlated with the un- 
derlying mass density field. Thus, if we denote by S m the 
matter density fluctuations field and by Nh the field of halo 
number (where both fields are smoothed in regions of some 
given volume), Nh and 5 m are related. We refer to this re- 
lation as the halo bias relation (see last subsection). Since 
in general the halo number in a volume depends not only 
on the mean mass density but also on other properties of 
the volume, the relation between Nh and <5 m is not expected 
to be deterministic. It must be stochastic. The stochasticity 
of the bias relation can be described by the conditional dis- 
tribution function, Pv(Nh\Sm), which gives the probability 
of finding Nh haloes in a volume V with mass density con- 
trast 8 m . This conditional probability completely specifies 
the relation between the mass and halo density fields in a 
statistical sense. Indeed, once Pv{Nh\8 m ) is known, the full 
count-in-cell function Py(Nh) for haloes can be obtained 
from the mass distribution function Pv{S m ) through 



Pv(Nh) 



Pv(N h | 5 m )Pv(5 m )dS„ 



(4) 



The form of Pv(Nh\S m ) depends on how dark haloes 
form in the cosmological density field and is not known a 
priori. The simplest stochastic assumption is that it is Pois- 
sonian. This assumption is in fact used in almost all i nter- 
pretation s of the moments of galaxy counts in cells (c.f. 



Pee- 



bles 198C), where terms of Poisson shot noise are subtracted 
to obtain the correlation strength of the underlying density 
field. However, this assumption is not solidly based, and so 
it is important to examine if other assumptions on the form 
of Pv{Nh\Sm) actually work better for dark haloes. In this 
paper, we test other three models of Pv{Nh\S m ), along with 
the Poisson model. These are the Gaussian model, the Log- 
normal model, and the thermodynamical model. T he last 
model was developed by saslaw & Hamilton (1984) for the 
distribution of galaxies. 
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2.3 A Model for the Halo-Mass Bias Relation 



3 TEST BY iV-BODY SIMULATIONS 



To second order, the probability distribution function 
Pv(Nh\Sm) is described by the mean bias relation N = 
N(S m ) and the variance a 2 = (N 2 \8 m ). MW developed a 
model for the mean bias relation of haloes based on the 
spherical collapse model. Their model works well for mas - 
sive haloes and an extension of it by Sheth et al. (2001) 



based on ellipsoidal collapse may work better for low mass 
haloes. 

Sheth & Lemson (1999) have presented a model for the 
variance of the bias relation which accounts for the halo 
exclusion due to the finite size of haloes (i.e. two different 
haloes can not occupy the same volume) . They showed that 
their model was able to describe the first and second mo- 
ments of the halo distribution from scale-free N-body sim- 
ulations. Nevertheless the model is expected to fail when 
the underlying clustering makes a significant contribution 
to the variance. As an amendment, we introduce an addi- 
tional term accounting for the clustering of haloes in high 
density regions. We use this phenomenological model for the 
variance of the halo bias relation. 

Briefly, the mean of the bias relation from th e MW 



model and our p henomenological modification of the Sheth 
& Lemson (1999) formula for the variance^], are given by 



(N) = J dm N(m,5i | M,S ) 
and 

a 2 = (N(N-l)) + (N) - (TV) 2 , 
where: 



(5) 



(6) 



(N(N-l)) = J dm 1 dm 2 N(mi,Si \ M,S ) 

JV(m 2 ,tfi ! M- mi ,S'){l + Aj 2 ), (7) 

N(m,5i | M, So) denotes the average number of haloes 
of mass m identified at a given epoch 21 [with a critical 
overdensity for collapse <5i = <5 C (1 + 21)] in an uncollapsed 
spherical region of comoving volume V with mass M and 
overdensity So, and 8' is the mass density contrast of the 
fraction of the volume not occupied by the m haloes. The 
additional term (1 + A£ 2 ) in the expression for the vari- 
ance accounts for the contribution from mass clustering and 
has been constructed as the simplest function of the vari- 
ance of the mass distribution with the property of having 
high values in overdense regions and of being unity in ho- 
mogeneous regions. As we will show below, a good fit to 
the simulation data can be achieved by choosing £ 2 to be 
the second order moment of the mass distribution on the 
scale in consideration. In this case, we can write the term 
M2 = M m (zi) » AD 2 (z 1 )^ m (0), where D{z) is the linear 
growth factor normalized to one at z = 0. The constant A 
is to be calibrated by simulations. 



1 We only use the spherical model here because a consistent im- 
plementation of the ellipsoidal model into the phenomenological 
model for the variance is not straightforward. 



3.1 Numerical Data 

For this study we use the spatial distribution of dark matter 
particles as well as of dark haloes from the ACDM version of 
the high resolution GIF N-body simulations (for details see 



Kauffmann et al. 199S). These simulations have 256 3 par- 



ticles in a grid of 512 cells, with a gravitational softening 
length of 20 h _1 kpc. In the ACDM case, the simulation as- 
sumes Q = 0.3, Qa = 0.7 and h — 0.7. The initial power 
spectrum has a shape parameter F = 0.21 and is normal- 
ized so that the rms of the linear mass density in a sphere 
of radius 8/i _1 Mpc is as = 0.9. The simulation box has a 
side length L = 141 h~ Mpc, and the mass of each particle 
is M p = 1.4 x 10 10 /i _1 M Q . 

The halo catalogues have been created by the GIF 
project (Kauffmann et al. 1999) using a friends-of- friends 
group-finder algorithm to locate virialized clumps of dark 
matter particles in the simulations outputs. They used a 
linking length of 0.2 times the mean interparticle separation 
and the minimum allowed mass of a halo is 10 particles. 
In what follows, the mass of a halo is represented by the 
number of particles it contains. 

We also use the galaxy catalogues constructed from the 
same simulations. The catalogues are limited to model galax- 
ies with stellar masses greater than ~2x 10 10 /i _1 Mq. For 
further details about these catalogues and the galaxy forma- 



tion m odels used in their construction see [Kauffmann et al 
(199S| ) 



The halo conditional probability Pv(Nh | S m ) has been 
estimated for various samples and a number of sampling vol- 
umes V. For the presentation, we only use samples of haloes 
selected at redshifts z = 3, 1 and 0. Two kinds of analyses 
are performed. In the first case, halo counts-in-cells are es- 
timated at the same time as when the haloes are identified, 
while in the second case counts-in-cells are estimated at a 
given time for the central particles of the haloes identified 
at an earlier time. As a convention, we use Z\ to denote the 
redshift at which haloes are identified, while using 20 to de- 
note the redshift at which the counts in cells are estimated. 
For example, a case with 20 = and zi = 3 means that 
haloes are identified at redshift 3 while the counts-in-cells 
are estimated for their central particles at redshift 0. The 
computations of the counts in cells are performed for vol- 
umes of cubical cells with side lengths 1/32, 1/16, 1/8, and 
1/4 times the side length of the simulation box, correspond- 
ing to 4.4, 8.8, 17.6 an d 35.2 /i.~ 1 Mpc in com oving units. The 
algorithm proposed by Szapudi et al. (1999), which allows an 
accurate determination of the probability function in a rel- 
atively short time, is applied to estimate the counts-in-cells 
on a grid of 256 3 cells. The conditional probability of finding 
Nh haloes in a cell of volume V given that the local mean 
mass overdensity has a value between 5 m and S m + A5 m is 
computed from the counts-in-cells through 



Pv(N h \5 m ) 



P(N h ,8 m )A8„ 
P(S m )AS m 



(8) 



where Pv{Nh,S m ) is the joint probability for finding Nh 
haloes and a mass overdensity between 5 m and 5 m + A8 m in 
a cell of volume V, and Pv{8m) is the distribution function 
for the underlying mass density field. 
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3.2 The Conditional Probability 

Figure ^ shows the conditional probability Py (Nh\5 m ) ob- 
tained from the simulations at several representative values 
of 8 m . The halo sample shown contains all present-day haloes 
with masses greater than 10 particles. The numerical condi- 
tional probabilities are compared with the fits to Gaussian, 
Lognormal, Poisson and Thermodynamical functions. From 
the figure it can bee seen that the Poisson model is in general 
a poor description of the present time conditional probabil- 
ity measured from the simulations, and that the Gaussian 
model is overall a quite good assumption. The Lognormal 
and Thermodynamical functions are a sort of intermediate 
functions, i.e they are not as poor descriptors for the con- 
ditional probability as the Poisson function is, but, on the 
other hand, they do not describe the features of the condi- 
tional function as good as the Gaussian function does. This 
result is valid for all the halo mass ranges under analysis 
and all the scales tested. 
As discussed by 



Coles fc Frenh_{j_991 Y the widely used 



Poisson model introduced by Layzer (1956) is based on the 
assumption that the probability to find a given number of 
galaxies (or haloes) in a volume is determined only by the 
local mass density field and that the probability to find a 
galaxy (or halo) in an infinitesimal volume SV is propor- 
tional to the density at this volume element and is indepen- 
dent of the probability to find another galaxy (or halo) in 
a neighboring infinitesimal volume. Therefore, our finding 
that the conditional probability for haloes has non-Poisson 
form suggests that the deviation from Poisson of the halo- 
biasing process is due to effects related to the probability for 
a halo to have neighbors, such as volume exclusion, cluster- 
ing and large-scale environments. Notice that some of these 
effects are already includ ed in the model for the variance of 
the bias relation (section 2.3). 



3.3 The Mean and Variance of Halo Bias 

Given that the Gaussian model is a reasonable fit to the 
conditional probability function, we now concentrate on the 
mean and variance of this function, which are the two quan- 
tities needed to specify a Gaussian distribution. In order to 
show deviations from the Poisson distribution, we consider 
the mean and the ratio between the variance and the mean: 



1 + S h 



(N\S m 
nV 



{N\S„ 



(9) 



where 8h = =A — 1 is the number density contrast of haloes 
and n is their mean number density. 

Figures |^-^| show the results given by the simulations. 
Results are shown for samples in four representative mass 
ranges: a) a sample of low mass haloes, b) a sample contain- 
ing both low and high mass haloes, c) a sample of interme- 
diate mass haloes and d) a sample of high mass haloes. The 
corresponding halo masses are shown in table 1. 

One sees that the ratio variance/mean shows a Poisson- 
like behavior (i.e. ~ 1) for low values of 8 m . This ratio 
becomes sub-Poisson (i.e. < 1) at intermediate values of 
5 m , and super-Poisson (> 1) for high values of S m . The ex- 
act change of the variance/mean ratio with 5 m depends on 
halo mass: the sub-Poisson variance extends to higher val- 
ues of S m for samples with higher halo masses. The volume- 



exclusion effect is reduced for the descendants of haloes iden- 
tified at an earlier epoch and the variance/mean ratio ap- 
proaches the Poisson value for the descendants of haloes 
selected at early times (see Figure rt) . 



The curves in Figures 



show model predictions. The 



mean bias relations given by the simulations are well de- 
scribed by the model of MW, confirming earlier results. The 
behavior of the variance/mean ratio is also reasonably well 
reproduced, when the constant A in equation (^) is chosen to 
be 0.05 (as given by the fit to the bias relation for present- 
day haloes in the simulation). Thus, sub-Poisson variance 
can be caused by halo exclusion while the super-Poisson 
variance at high <5 m may be explained by the clustering of 
mass at the time of halo identification. The model for the 
variance begins to fail at very high values of 8 m . But since 
cells with such high densities are only a tiny fraction of all 
cells, this failure is not important. 



3.4 Bias Relation for Model Galaxies 

We have also estimated the mean and variance of the bias 
relation between model galaxies, from the GIF simulations, 
with stellar masses greater than ~2x 10 10 Mq and the 
underlying mass density, with the results shown in Figure 
H Interestingly, the variance/mean ratio in the galaxy- mass 
bias relation also exhibits significant sub-Poissonian behav- 
ior, implying that the effect of volume exclusion is also im- 
portant for the spatial distribution of galaxies. One possible 
reason for this is that many of the galaxy-sized haloes may 
host only one galaxy and the galaxy distribution inherits a 
considerable fraction of the exclusion effects from the distri- 
bution of their host haloes. 

If this is true for real galaxies, it has important implica- 
tions for the interpretations of galaxy clustering, as we will 
see in Section ^| 



3.5 The Count-in-Cell Function of Dark Haloes 

As an additional test for the bias model, we use the sim- 
ulation result for Pv{8 m ) and the theoretical model for 
Pv{Nh\6m) [i-e. a Gaussian conditional probability function 
with the mean and variance given by equations (2)-(4)] to 
reconstruct the counts-in-cells functions for haloes by using 
equation (Q) . As we only want to test the model of the bias 
relation, we do not use theoretical models for Py(8 m ), al - 



though such models do exist [e.g. the model of Sheth (199£) 
based o n excursion set approa ch, and the Lognormal model 
used in Coles & Jones (1991)]. Since the probability func- 



tions obtained from the simulations are quite noisy at very 
high values of 8 m and the model predictions in this regime 
may fail, we truncate our computations at a given high value 
of 8 m [8™ ax = 10 at the scales I = 4.4 and I = 8.8 h' 1 Mpc, 
and SZ ax = 3 at I = 17.6 h' 1 Mpc], which correspond to 
the low-probability tail of the mass probability function, as 
can be seen clearly in the lower panel in figure (2) . For com- 
parison we also reconstruct the halo count-in-cell functions 
using a Poissonian form for the conditional function, with 
the mean given by equation (2). In Figure ([]) we compare 
the reconstructed halo count-in-cell functions for present- 
day haloes containing more than 10 particles with the cor- 
responding functions obtained directly from the simulations. 
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Figure 1. Comparison between the conditional probability measured from the simulations (squares) for present epoch haloes with 
masses greater than 10 particles and the corresponding best fits of the Poisson (dash-dot-dot-dot line), Thermodynamical (dashed line), 
Lognormal (dash-dot line) and Gaussian (solid line) distribution functions. The rows correspond, from top to bottom, to the sampling 
scales I = 4.4, 8.8, 17.6, 35.2 Mpc/h, respectively. For each sampling scale there are four plots corresponding to the local mass overdensity 
as indicated in the labels. 



Clearly, the model matches the simulation results remark- 
ably well. The halo count-in-cell functions reconstructed us- 
ing a Poissonian form for the conditional probability func- 
tion depart from the corresponding numerical values in the 
low-probability, high density tail. 

The halo count-in-cell functions obtained through this 
approach can be used to calculate the high-order moments, 
such as skewness and kurtosis, of halo distributions. This 



application will be presented in detail in a forthcoming pa- 
per. 



4 DISCUSSION AND SUMMARY 

In this paper, we have analyzed in detail the conditional 
probability function Pv(N\5 m ) to understand the stochastic 
nature of halo bias. We have found that in high-resolution 
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Sample 


21 = 


21 = 1 


21 = 3 


a) 


M h = 20-30 particles 
M h /M* = 0.03-0.04 


M h = 20-30 particles 
M h /M* = 0.62-0.92 


M h = 20-30 particles 
M h /M* = 100-150 


b) 


M h = 20-2000 particles 
M h /AL = 0.03-2.85 


M h = 20-2000 particles 
M h /M* = 0.62-61.5 


M h = 20-600 particles 
M h /hU = 100-3000 


c) 


M h = 200-800 particles 
M h /NL = 0.28-1.14 


M h = 200-800 particles 
M h /M* = 6.15-24.6 


M h = 50-100 
M h /NL = 250-500 


d) 


Mh > 800 particles 
Mh/M* > 1.14 


M h > 800 particles 
M h /M+ > 24.6 


M h > 200 
M h /A'h > 1000 



Table 1. Ranges of halo masses corresponding to the samples shown in figures pj-^. a) sample of low mass haloes, b) sample containing 
both low and high mass haloes, c) sample of intermediate mass haloes and d) sample of high mass haloes. is defined by <r(M*) = 1.68. 



iV-body simulations this function is well represented by a 
Gaussian model, and that a Poisson model is generally a 
poor approximation. That means that the galaxy biasing 
process, as well as the halo biasing process, is not only de- 
termined by the local value of the mass density field, but 
also by other local quantities, such as dumpiness, and by 
non-local properties, such as large-scale tidal field. 

We have shown that a simple, phenomenological model 
can be constructed for Pv(N\5 m )- This allows one to con- 
struct a theoretical model for the full count-in-cell function 
for dark haloes. The galaxy distribution in the cosmic den- 
sity field predicted by semi-analytic models of galaxy forma- 
tion shows similar stochastic behavior to that of the haloes, 
implying that galaxy distribution is not a Poisson sampling 
of the underlying density field. 

These results have important implications in the inter- 
pretations of galaxy clustering in terms of the underlying 
density field. For example, the quantity conventionally used 
to characterize the second moment of counts-in-cells is de- 
fined (here for dark halo) as 



((N~nV) 2 ) 



(10) 



{nV) 2 (nV) ' 

where the second term on the right-h and side is to subtract 
Poisson shot noise (e.g. Peebles 198C ). With the use of equa- 
tion (^), it is easy to show that 

k 2 (R) = —^[{N\8 m ) 2 P v (8 m )dS m 



(nV) 

+ 7-^5 J [<r 2 - (N\5 m )] Pv(S m ) d5 m -l. (11) 

Thus, even if haloes trace mass on average, i.e. {N\S m ) oc 5 m , 
this quantity is not equal to the second moment for the mass, 
because the second term on the right-hand side is gener- 
ally non-zero. Furthermore, the non-Poissonian behavior of 
the bias relation might imply that the (Poisson) shot-noise 
corrections usually applied at estimating higher-order mo- 
ments of the galaxy distribution are not completely correct 
and therefore interpretations of skewness and kurtosis might 
change considerably, at least at the scales where shot-noise 
terms are not too small. This issue needs to be investigated 
in more detail. Thus, in order to infer the properties of the 
mass distribution in the Universe from statistical measures 
of the galaxy distribution, it is necessary to understand the 

stochastic nature of galaxy biasing. 

As discussed in Dekel fc Lahav (1999 ), the stochastic- 



ity in galaxy biasing not only affects the interpretation of 
the moments of the galaxy distribution, but also affects the 
interpretation of other statistical measures of galaxy cluster- 
ing, such as redshift distortions, the cosmic virial theorem 
and the cosmic energy equation. With the results shown in 
the present paper, one can model quantitatively many of 
these effects. 
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Figure 3. Same results as shown in figure (H) but for haloes identified and analyzed at redshift z = 1. 
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Figure 4. The same as in figure (Eh but for the present epoch descendants of haloes already formed at z = 3. 
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Figure 5. Mean bias relation and ratio between the variance 
and the mean of the bias relation of galaxies obtained from the 
simulations using semi-analytical models of galaxy formation. We 
show model galaxies at the present epoch (upper panel) and at 
redshift 1 (lower panel) . The mean and ratio between the variance 
and the mean of the bias relation are shown in the top and bottom 
rows in each panel, respectively. At each epoch the cubical cells 
of side length, from left to right, I = 4.4, 8.8, 17.6 h~ 1 Mpc are 
shown. 



Figure 6. Halo count-in-cell functions for a sample of present 
day haloes with masses greater than 10 particles. The circles cor- 
respond to the probability function obtained from the simula- 
tions and the lines to the semi-analytically reconstructed count- 
in-cell function using spherical collapse approach. The solid and 
dashed lines show the reconstructed functions using a Gaussian 
and a Poissonian form for the conditional probability function, 
respectively. The open circles show the simulated halo count-in- 
cell functions obtained from the complete mass and conditional 
probability functions, while the filled circles show the simulated 
mass count-in-cell functions obtained from the mass and condi- 
tional probability functions truncated at high values of the mass 
density contrast. The boxes correspond, from top to bottom, to 
the scales I = 4.4, 8.8, 17.6 h" 1 Mpc. 



